Model Selection

Noise robustness

# Noise robustness

Ultravox V0 6 Qwen 3 32b

Ultravox is a large multimodal speech language model capable of understanding and processing speech input, supporting multiple languages and noisy environments.

Transformers Supports Multiple Languages

Ichigo Llama3.1 S Instruct V0.4

A multimodal language model based on Llama-3 architecture, supporting audio and text input understanding with noise robustness and multi-turn dialogue capabilities

Safetensors English

Whisper Small Ita

An Italian-optimized speech recognition model based on OpenAI Whisper-small, enhanced with special tags for improved metadata capture

Speech Recognition

Transformers Supports Multiple Languages

Whisper Medium.en Fine Tuned For ATC

Fine-tuned based on the OpenAI Whisper Medium EN model, specifically optimized for speech recognition of air traffic control communications, with an 84% reduction in word error rate

Speech Recognition

Safetensors English

ByT5 is a tokenizer-free version of Google's T5 that directly processes raw UTF-8 bytes, supporting multilingual text processing with excellent performance on noisy data.

Large Language Model Supports Multiple Languages

ByT5 is a tokenizer-free version of Google's T5 that directly processes UTF-8 byte sequences, supporting multilingual text processing with robustness to noisy data.

Large Language Model Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase